Detecting protein dissimilarities in multiple alignments using Bayesian variable selection
نویسندگان
چکیده
MOTIVATION We present an application of Bayesian variable selection to the novel detection of sequence elements that confer negative design to protein structure and function. As an illustration, we analyze the different dimer interfaces between the CXCL8 chemokine family with the CCL4 and CCL2 chemokine families to discover the changes that disfavor CXCL8 of quaternary structure. RESULTS In comparison with known experimental results, our method identifies evolutionarily conserved sequence changes in the CC families that inhibit CXCL8 quaternary structure. Therefore, we find positive selection of negative design elements. Furthermore, our approach predicts that a two-residue deletion conserved in the CCL4 chemokine family disfavors CXCL8 dimerization. AVAILABILITY The Matlab code for the Bayesian variable selection is freely available at http://stat.tamu.edu/~mvannucci/webpages/codes.html
منابع مشابه
Erasing errors due to alignment ambiguity when estimating positive selection.
Current estimates of diversifying positive selection rely on first having an accurate multiple sequence alignment. Simulation studies have shown that under biologically plausible conditions, relying on a single estimate of the alignment from commonly used alignment software can lead to unacceptably high false-positive rates in detecting diversifying positive selection. We present a novel statis...
متن کاملDetecting Selection in DNA Sequences: Bayesian Modelling and Inference
Recent developments in Bayesian modelling of DNA sequence data for detecting natural selection at the amino acid level are presented. This article summarizes and discusses empirical model-based approaches. Key features of the modelling framework include the incorporation of biologically meaningful information via structured priors, posterior detection of sites under selection, and model validat...
متن کاملTOPALi v2: a rich graphical interface for evolutionary analyses of multiple alignments on HPC clusters and multi-core desktops
UNLABELLED TOPALi v2 simplifies and automates the use of several methods for the evolutionary analysis of multiple sequence alignments. Jobs are submitted from a Java graphical user interface as TOPALi web services to either run remotely on high-performance computing clusters or locally (with multiple cores supported). Methods available include model selection and phylogenetic tree estimation u...
متن کاملAddressing the shortcomings of three recent Bayesian methods for detecting interspecific recombination in DNA sequence alignments.
We address a potential shortcoming of three probabilistic models for detecting interspecific recombination in DNA sequence alignments: the multiple change-point model (MCP) of Suchard et al. (2003), the dual multiple change-point model (DMCP) of Minin et al. (2005), and the phylogenetic factorial hidden Markov model (PFHMM) of Husmeier (2005). These models are based on the Bayesian paradigm, wh...
متن کاملBayesian Sample Size Determination for Joint Modeling of Longitudinal Measurements and Survival Data
A longitudinal study refers to collection of a response variable and possibly some explanatory variables at multiple follow-up times. In many clinical studies with longitudinal measurements, the response variable, for each patient is collected as long as an event of interest, which considered as clinical end point, occurs. Joint modeling of continuous longitudinal measurements and survival time...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Bioinformatics
دوره 23 2 شماره
صفحات -
تاریخ انتشار 2007